A Two-pass Strategy for Handli Vocabulary Recognit

نویسنده

  • Odette Scharenborg
چکیده

This paper addresses the issue of large-vocabulary recognition in a specific word class. We propose a two-pass strategy in which only major cities are explicitly represented in the first stage lexicon. An unknown word model encoded as a phone loop is used to detect OOV city names (referred to as rare city names). After which SpeM, a tool that can extract words and word-initial cohorts from phone graphs on the basis of a large fallback lexicon, provides an N-best list of promising city names on the basis of the phone sequences generated in the first stage. This N-best list is then inserted into the second stage lexicon for a subsequent recognition pass. Experiments were conducted on a set of spontaneous telephone-quality utterances each containing one rare city name. We tested the size of the N-best list and three types of language models (LMs). The experiments showed that SpeM was able to include nearly 85% of the correct city names into an N-best list of 3000 city names when a unigram LM, which also boosted the unigram scores of a city name in a given state, was used.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Effect of Concept Mapping on Iranian EFL Learners’ Vocabulary Learning and Strategy Use

This study aimed to investigate the effects of concept mapping on the extent to which Iranian EFL learners retain new vocabularies and the degree of awareness toward vocabulary learning strategies they tended to use. To this end, a total of 40 Iranian EFL students were asked to participate in this study. They were randomly assigned to two equal groups; namely, experimental and control. The part...

متن کامل

The Effect of Emotionality and Openness to Experience on Vocabulary Learning Strategies of Iranian EFL Students

This study explored the relationship between vocabulary learning strategies and learner variables of Iranian learners of English as a foreign Language (EFL) with special reference to their personality types to examine what implications these associations have for teaching EFL. It tried to find any possible relation between vocabulary learning strategies use of Iranian EFL students and two perso...

متن کامل

Investigating the Effects of Rote and Contextualized Memorization on Iranian Elementary EFL Learners’ Vocabulary Development

It is obvious that vocabulary lies in the center of language learning and communication. This issue shows that vocabulary has a vital role in mastering all the skills of a language. Vocabulary Learning Strategies (VLS) facilitate the process of learning lexical items. The present study investigated the role of two specific strategy types in vocabulary learning, including rote and contextualized...

متن کامل

Microsoft Word - A New Language Model For Automatic Arabic Speech Recognit¡¦

A new language model for Arabic language for large vocabulary automatic speech recognition (ASR) is introduced. The derivative future of the Arabic word is quite useful in dividing the process into two phases. In phase-1 the fixed words, the prefix, the suffix and the form of the derivative words are determined through phase-1M-gram, of course, given the acoustical data. In phase 2 another M-gr...

متن کامل

Novel two-pass search strategy using time-asynchronous shortest-first second-pass beam search

In this paper, we describe a novel two-pass search strategy for large vocabulary continuous speech recognition. The first-pass of this strategy uses a regular time-synchronous beam search with rough models to generate a word lattice. Then, the second-pass search derives exact results from the word lattice using more accurate models. This search is “time-asynchronous shortest-first beam search”,...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005